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Genetic Evidence of a Long-Range RNA-RNA Interaction between the 
Genomic 5’ Untranslated Region and the Nonstructural Protein 1 
Coding Region in Murine and Bovine Coronaviruses 


Bo-Jhih Guan,? Yu-Pin Su,? Hung-Yi Wu,>* and David A. Brian?” 


Departments of Microbiology? and Biomedical and Diagnostic Sciences,’ University of Tennessee College of Veterinary Medicine, Knoxville, Tennessee, USA 


Higher-order RNA structures in the 5’ untranslated regions (UTRs) of the mouse hepatitis coronavirus (MHV) and bovine 
coronavirus (BCoV), separate species in the betacoronavirus genus, appear to be largely conserved despite an ~36% nucle- 
otide sequence divergence. In a previous study, each of three 5’-end-proximal cis-acting stem-loop domains in the BCoV 
genome, I/II, II, and IV, yielded near-wild-type (wt) MHV phenotypes when used by reverse genetics to replace its coun- 
terpart in the MHV genome. Replacement with the BCoV 32-nucleotide (nt) inter-stem-loop fourth domain between stem- 
loops III and IV, however, required blind cell passaging for virus recovery. Here, we describe suppressor mutations within 
the transplanted BCoV 32-nt domain that along with appearance of potential base pairings identify an RNA-RNA interac- 
tion between this domain and a 32-nt region ~200 nt downstream within the nonstructural protein 1 (Nsp1)-coding re- 
gion. Mfold and phylogenetic covariation patterns among similarly grouped betacoronaviruses support this interaction, as 
does cotransplantation of the BCoV 5’ UTR and its downstream base-pairing domain. Interestingly, cotransplantation of 
the BCoV 5’ UTR and BCoV NspI1 coding region directly yielded an MHV wt-like phenotype, which demonstrates a cog- 
nate interaction between these two BCoV regions, which in the MHV genome act in a fully interspecies-compliant manner. 


Surprisingly, the 30-nt inter-stem-loop domain in the MHV genome can be deleted and viral progeny, although debili- 
tated, are still produced. These results together identify a previously undescribed long-range RNA-RNA interaction be- 
tween the 5’ UTR and Nsp1 coding region in MHV-like and BCoV-like betacoronaviruses that is cis acting for viral fitness 
but is not absolutely required for viral replication in cell culture. 


I positive-strand RNA viruses that replicate in the cytoplasm, 
genomic 5'-end-proximal RNA structures carry out several func- 
tions required for virus reproduction. In coronaviruses, these are 
thought to include (i) translation initiation, commonly presumed to 
occur by a canonical cap-dependent 5’-terminal ribosomal entry 
mechanism, to synthesize the replicase enzymes from open reading 
frame 1 (13, 14, 31); (ii) signaling an RNA-dependent RNA polymer- 
ase template switch during minus-strand synthesis at a heptameric 
transcription-regulatory sequence (UCUAAAC in the case of 
mouse hepatitis coronavirus [MHV] and bovine coronavirus 
[BCoV]) (Fig. 1A and B) for placement of a common leader on 
subgenomic mRNAs (sgmRNAs) (43, 48, 49, 55); (iii) encoding 
signals on the 3’ end of minus-strand genomic RNA (the antig- 
enome) for initiating synthesis of plus-strand genomic mRNAs 
and sgmRNAs (9, 43, 48, 55); (iv) possibly harboring signals that 
act in trans to initiate synthesis of nascent plus-strand genomes 
(46); (v) possibly directly influencing initiation of minus-strand 
synthesis at the 3’ end of the genome (33); and (vi) harboring a 
genome packaging signal (10, 18). A mechanistic understanding 
of these events may aid in the development of therapeutic inhibi- 
tors of coronavirus replication. 

In a previous study (22), we exploited the ~36% nucleotide 
(nt) sequence divergence between the 5’ untranslated regions 
(UTRs) of BCoV and MHV, separate species in the betacoronavi- 
rus genus (15) (formerly classified as group 2a coronaviruses), to 
identify potentially common 5’-end-proximal cis-acting elements 
for replication of MHV. Despite the relatively high nucleotide 
sequence divergence, higher-order structures in the 5’-end-prox- 
imal genomic regions of these two viruses are largely conserved 
(Fig. 1A and B) (7, 10, 29, 33, 35, 36, 41, 42), although the nomen- 
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clature is not yet uniform, since stem-loops (SLs) II, III, and IV in 
Fig. 1 are respectively named stem-loops 3, 4 and 5 by the Leibow- 
itz, Giedroc, and Olsthoorn laboratories (10, 29, 33, 35, 36). It was 
learned that the BCoV SLI/II domain (nt 1 to 84), which contains 
higher-order structures named SLI (now known to be comprised 
of stem-loops 1 and 2) (29, 33, 35, 36) and SLH, can functionally 
replace its MHV counterpart, as can the SLII domain (nt 85 to 
141) and the SLIV domain (nt 174 to 226). For function, SLIV 
required a 16-nt extension into the nonstructural protein 1 
(Nsp1) coding region to form a “long” SLIV (Fig. 1A and B). In 
each of these substitutions, despite nucleotide differences of 37, 
22, and 33%, respectively, the chimeric virus replicated for at least 
three passages without compensatory suppressor mutations ap- 
pearing within the 5’-most 1,000 nt and 3’-most 500 nt of the 
genome. In contrast, a 30-nt inter-stem-loop domain in the MHV 
5’ UTR that resides between SLIII and SLIV (nt 141 to 170) could 
not be replaced by the 32-nt BCoV counterpart (nt 142 to 173) to 
directly yield viable virus after transfection of the chimeric RNA 
genome (22). However, the resulting chimera, BM?173/M, pro- 
duced plaques and viable progeny after two blind cell passages in 
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one of four experimental attempts. These results suggested that 
there had been compensatory mutations within the surviving chi- 
meras that enabled replication. 

Here, we describe seven potential suppressor mutations that 
appeared within the transplanted 32-nt BCoV inter-stem-loop 
domain and five within a 32-nt partnering domain mapping 
~200 nt downstream within the Nsp1 coding region and dem- 
onstrate the importance of these as cognate domains for repli- 
cation in genetic reconstruction assays. The Nsp1 coding re- 
gion is the first of 16 in the replicase gene (16, 21), and Nsp1 is 
an RNA binding protein with several postulated functions in 
coronavirus replication (5, 6, 23, 27, 28, 32, 39, 47, 56). Sur- 
prisingly, the 30-nt inter-stem-loop domain in the 5’ UTR of 
the MHV genome can be deleted and viral progeny, although 
debilitated, are still produced. The results together identify a 
previously undescribed long-range cis-acting RNA-RNA inter- 
action between the 5’ UTR and Nsp1 coding region in the 
murine and bovine betacoronaviruses that appears to be im- 
portant for viral fitness but is not absolutely required for virus 
replication in cell culture. 


MATERIALS AND METHODS 


Cells. Delayed brain tumor (DBT) cells (24), mouse L2 cells (19), and 
baby hamster kidney cells expressing the MHV receptor (BHK- 
MHVR) (11, 12) were grown in Dulbecco’s modified Eagle’s medium 
(DMEM) supplemented with 10% defined fetal calf serum (FCS) (Hy- 
Clone) and 20 ug/ml gentamicin (Invitrogen). Cells were maintained 
at 37°C with 5% CO, for all experiments. BHK-MHVR cells were 
maintained in selection medium containing 0.8 mg/ml Geneticin 
(G418 sulfate; Invitrogen) (52). 

Viruses and GenBank accession numbers. GenBank accession num- 
bers for virus sequences analyzed are as follows: NC_001846 for MHV- 
A59; NC_006852 for MHV-JHM; NC_012936 for Parker rat CoV; 
NC_006577 for human coronavirus (HCoV)-HKU1; NC_010327 for 
equine coronavirus (ECoV); NC_005147 for HCoV-OC43; NC_007732 
for porcine hemagglutinating encephalomyelitis virus (PHEV); FJ415324 
for human enteric coronavirus-4408 (HECoV-4408); U00735 for BCoV- 
Mebus; and NC_004718 for SARS coronavirus, Toronto strain (SCoV- 
Tor2). 

RNA structure prediction and sequence alignments. The Mfold pro- 
gram of Zuker (http://www. bioinfo.rpi.edu/~zukerm/) (38, 54) was used 
for RNA structure predictions. The long-range RNA-RNA base pairing 
patterns described below were revealed by folding nt 1 to 400 or nt 1 to 
500. Sequence alignments were made with Vector NTI suite 8 software 
(Invitrogen). 

Reverse genetics system used for making BCoV/MHV chimeras and 
methods for analyzing viral progeny. The reverse genetics system for 
MHV-AS9, infectious clone MHV-A59-1000 (icMHV) (developed by R. 
Baric and colleagues [17, 52] and kindly provided by R. Baric), was used as 
previously described in detail for making 5'-end-proximal BCoV/MHV 
chimeras in the MHV-A59 background (22). Recombinant viruses de- 
scribed here were made by modifying fragment A of MHV-AS59 as re- 
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ported previously (22) but with the appropriate primers for the mutations 
described below. All procedures for plasmid construction with icMHV 
DNA, expression of recombinant viral RNA, transfection of cells with 
infectious recombinant RNA by electroporation, and characterization of 
mutant progeny by virus titration and growth kinetics were carried out as 
previously described (22). Plaque morphology was determined on L2 cells 
after 60 h of growth and after crystal violet staining as described previously 
(22). Plaque sizes were identified as large (wild type [wt]) if they were 
=>2.5 mm, medium if 1.5 to 2.5 mm, or small if =1.5 mm in diameter. 
Representative plaque images were taken with a Nikon digital camera and 
prepared using Adobe Photoshop CS software. 

Genome sequence analysis of chimera progeny. Routinely, superna- 
tant fluids from cells that first showed cytopathic effect (CPE) (either cells 
that had been transfected or cells that had been blind-passaged) were 
collected, and virus contained therein was referred to as virus passage zero 
(VPO) virus. When 80 to 100% of new DBT cells infected with VPO virus 
showed CPE, intracellular RNA was extracted with TRIzol (Invitrogen), 
and the viral genome was sequenced by RT-PCR for the 5’-end-proximal 
nt 22 to 1093 and the 3’-terminal 500 nt (excluding the poly(A) tail). VPO 
virus was then used to determine plaque morphology, and plaque-puri- 
fied virus was used as starting material for growth kinetics on DBT cells 
and sequence analyses. 

For analysis of the 5’ nt 22 to 1093 of progeny virus genomes, 
extracted cellular RNA was reverse transcribed with Superscript II 
reverse transcriptase (Invitrogen) using primer MHV-1094(+) (5'CG 
ATCAACGTGCCAAGCCACAAGG3’), which binds MHV genomic nt 
1094 to 1117, and cDNA was PCR amplified with primers MHV- 
leader(—) (5’TATAAGAGTGATTGGCGTCCG3’), which binds nt 1 to 
21 of the MHV antileader, or BCoV-leader(—) (5'GATTGTGAGCGATT 
TGCGTGCG3’), which binds nt 1 to 22 of the BCoV antileader, and 
MHV-1094(+). PCR products were gel purified (QIAEX II; Qiagen) prior 
to automated sequencing with primers MHV(261-284)(—) (5'CCATGG 
ATGCTTCCGAACGCATCG3’) and MHV(605—623)(+) (5’GTTACAC 
AGGCAGACGCGC3’). 

For analysis of the 3’ 500 nt of progeny virus genomes, the same pro- 
cedure was used except that total RNA was reverse transcribed using 
primer DI3(+) (5'CGGGATCCGTCGACACGCGTTTTTTTTTTTTTT 
TTTTTT3’), which binds the poly(A) tail, and the cDNA was PCR ampli- 
fied with primer MHV(30811-30830)(—) (5’GGATGGTGGTGCAGAT 
GTGG3’), which binds the negative strand of virus nt 30811 to 30830, and 
primer DI3(+). Gel-purified PCR products were sequenced with primer 
MHV(30811-30830)(—). 

Northern analysis. Northern analyses were done as previously de- 
scribed (25). Briefly, freshly confluent DBT cells in 25-cm*? flasks (~4 X 
10° cells) were infected with wt or chimeric viruses at an MOI of 0.01 
PFU/cell. At 20 h postinfection, intracellular RNA was TRIzol extracted, 
and 1/10 of the total RNA from one 25-cm? flask (~60 xg RNA total per 
25-cm* flask) was resolved by electrophoresis in a 1.0% agarose-formal- 
dehyde gel at 150 V for 4 h. RNA was transferred to a Hybond N* nylon 
membrane (Amersham Biosciences) by vacuum blotting for 3 h followed 
by UV cross-linking. After prehybridization of the membrane with 
NorthernMax prehybridization/hybridization buffer (Ambion) at 55°C 
for 4 h, the blot was probed at 55°C overnight with 20 pmol (~4 10° 
cpm/pmol) of the 5’-end y-*?P-labeled 3'-UTR-specific oligonucleotide 


FIG 1 5’ UTRs of MHV-A59 and BCoV-Mebus showing the intra-5’-UTR stem-loops, the inter-stem-loop domain, and a potential long-range RNA-RNA 
interaction between the 5’ UTR and the Nsp1 coding region. (A) 5’ UTR of MHV showing the Mfold-predicted long-range RNA-RNA interaction (hatched box). 
(B) 5’ UTR of BCoV showing the Mfold-predicted long-range RNA-RNA interaction (hatched box). (C) 5’ UTR of MHV in which the 32-nt BCoV inter-stem- 
loop domain (lowercase) replaces the 30-nt MHV counterpart. The hatched box identifies a potential long-range interaction and shows a summary of the 
potential suppressor mutations in 17 plaque-purified clones described in Table 1. The subscript for each mutation indicates its incidence among the 17 isolates. 
Note that without suppressor mutations, the number of base pairs within the window is 19, not 27 (as in MHV) or 25 (as in BCoV). Refolding enables the number 
of base pairs to increase as a function of suppressor mutations (see the text). In all panels, the underlined bases in SLII identify the transcription-regulatory 
sequence (UCUAAAC), and boxes in SLIII identify the start and stop codons for a short upstream open reading frame. The box in SLIV identifies the start codon 
for the Nsp1 coding region. Filled circles in SL1, SLU, and SLIII represent non-Watson-and-Crick base pairings, as determined by nuclear magnetic resonance 


studies (33, 36). 
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MHV(31094-31122)(+) (5'CAGCAAGACATCCATTCTGATAGAGA 
GTG3’), which binds MHV genomic nt 31094 to 31122. Probed blots 
were exposed to Kodak XAR-5 film at —80°C for imaging, and images 
were prepared using Adobe Photoshop CS software. 


RESULTS 

Identification of seven mutations within the transplanted inter- 
stem-loop domain and five within the Nsp1 coding domain that 
potentially reconstitute base-pairing between the two domains 
in phenotypic revertants. It was demonstrated previously that the 
30-nt inter-stem-loop domain mapping between SLIII and SLIV 
in the MHV 5’ UTR behaves as a species-specific cis-acting repli- 
cation element, since the 32-nt homologue from the BCoV ge- 
nome was essentially functionally incompatible with the MHV 
genome, as shown by the chimera BY? 173M (Fig. 1C) (22). Nu- 
cleotides 131 to 140, the 5'-terminal 10 nt of the entire sequence 
between SLIII and SLIV, were not included as part of the inter- 
stem-loop domain here or previously (22), because by some pre- 
dictions it makes up part of SLIII in the MHV 5’ UTR (29). Like- 
wise, BCoV nt 130 to 141 were not included as part of the 
transplanted 32-nt inter-stem-loop domain. B!**-'7?/M chimeric 
RNA containing the 32-nt transplanted BCoV domain during 
three transfection experiments was found to yield no viable prog- 
eny after four blind cell passages, but in a fourth trial, a mixture of 
small and wt-like large plaques were found after two blind cell 
passages (Fig. 2A, VPO plaques). From this, six small plaques and 
11 large plaques were selected, and each was passaged 10 times in 
DBT cells. Note that virus causing smaller plaques developed 
larger plaques and near-wt growth kinetics with passaging (Fig. 2). 
To determine what potential suppressor mutations might have 
caused the phenotypic changes within the transplanted 5’ UTR, 
the adjacent Nsp1 coding domain, and the 3’ UTR, viral genomic 
RNA isolated from cells infected at virus passages 1, 6, and 10 was 
sequenced through these regions. No 3’-end-proximal changes 
were found, and nucleotide changes within the 5’-end-proximal 
nt 22 through 1093 are summarized in Table 1. 

The most striking result was a clustering of potential sup- 
pressor mutations within the transplanted 32-nt BCoV domain 
and within a 32-nt region mapping ~200 nt downstream 
within the MHV Nsp1 coding region (summarized in Table 1 
and schematically depicted in Fig. 1C and 3A). It was also strik- 
ing that whereas there were fewer predicted base pairings ini- 
tially between the two regions in B‘4*"'7*/M RNA (19 bp) (Fig. 
1C) than in Mfold-predicted wt MHV RNA (27 bp) (Fig. 1A), 
when predicted foldings included the putative suppressor mu- 
tations at virus passage 10 (Table 1), the number of base pairs 
within the window increased from 19 (Fig. 1C and 3A) to 22 to 
25 (Table 1 and Fig. 3B). For example, when the combined 
suppressor mutations in S] and S2 at VP10 were used, Mfold 
predicted the first potential folding pattern (potential folding 
1) in Fig. 3B. Base pairings of 22 to 25 in revertants compare 
with 27 and 25 bp, respectively, for wt MHV and wt BCoV (Fig. 
1A and B). These observations together led us to hypothesize 
that the suppressor mutations may reveal a specific long-range 
RNA-RNA interaction between the 5’ UTR and the Nsp1 cod- 
ing region that is important for virus replication. 

Mfold predicts base pairing between the inter-stem-loop do- 
main and a region within the Nsp1 coding region of several 
MHV-like and BCoV-like betacoronaviruses. After a potential 
base pairing between the two domains was first noted, as described 
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above, Mfold patterns that might identify base pairings for the 
long-range RNA-RNA interaction were sought by folding nt 1 to 
400 or nt 1 to 500 of additional fully sequenced MHV-like and 
BCoV-like betacoronaviruses. Interestingly, base pairing between 
the inter-stem-loop domains (Fig. 4A) and respective partnering 
domains were readily found for the MHV-like, HCoV-HKUI, and 
BCoV-like coronaviruses (Fig. 4B) that revealed 25 to 27 potential 
base pairings within the 30- to 32-nt window. Figure 4B also de- 
picts the phylogenetic covariation of sequences that maintain base 
pairings within the virus group. The Mfold and covariation re- 
sults, therefore, appear to be consistent with a functional role for 
the base pairing as postulated above. 

In vitro mutagenesis to form base pairing in the B'**"'7*/M 
chimera with C158G, A163U, and G169U (thus recapitulating 
reverted virus S1 at passage 10) enabled immediate wt-like prog- 
eny after transfection of the chimeric genome. To directly test 
the hypothesis that base-pairing between the inter-stem-loop and 
the Nsp1 partnering domains imparts fitness to the debilitated 
chimera, C158G, A163U, and G169U, the three putative suppres- 
sor mutations in isolate $1 at passage 10 (Table 1), were used to 
prepare chimera B!*?-!”S1/M (Fig. 5A), which was then tested by 
transfection. Whereas B'**~!”°/M required two blind cell passages 
before the appearance of CPE (Fig. 2A), B'4?-'7°S1/M developed 
CPE within 24 h posttransfection (hpt). Moreover, progeny from 
VPO virus developed large plaques (Fig. 5B), had no additional 
putative suppressor mutations within the sequenced regions, had 
growth kinetics similar to that of wt MHV but with a 10-fold- 
lower final titer (10’ PFU/ml versus 10° PFU/ml for wt MHV) 
(Fig. 5C), and had a wt-like sgmRNA profile as determined by 
Northern analysis (Fig. 5D). 

These results, therefore, are consistent with the hypothesis that 
base pairing between the inter-stem-loop domain and the Nsp1 
coding domain contributes to the replicating ability and fitness of 
MAHV in cell culture. 

Preplaced suppressor mutations within the Nsp1 coding re- 
gion of the otherwise B'*”'7/M chimeric RNA hastened the ap- 
pearance of phenotypic revertants and suppressor mutations 
within the 5’-UTR-mapping inter-stem-loop domain. To deter- 
mine whether the rarer putative suppressor mutations within 
the Nsp1 coding region are sufficient for phenotypic reversion, the 
mutations A353U and A363G were separately placed into the 
MHV genome along with the wt BCoV inter-stem-loop domain 
and tested. With B'47-!7°49°Y/M RNA, CPE appeared within 48 
hpt and VP0 virus generated wt-like plaques, retained the A353U 
mutation, and also contained G169U within the transplanted do- 
main (data not shown). With B14?-17*49®S/M, CPE appeared 
within 48 hpt and VPO virus generated wt-like plaques, retained 
the A363G mutation, and also contained G169U within the trans- 
planted domain (data not shown). These results show that the 
A353U and A363G mutations each alone may not suffice as sup- 
pressor mutations but likely hastened the appearance of addi- 
tional suppressor mutations within the transplanted pyrimidine- 
rich domain. 

With the exception of the of a C30U mutation, single potential 
suppressor mutations occurring outside the long-range base-pair- 
ing windows, namely, C39U, U240C, and A591G, were not tested 
as suppressors (Table 1). When the C30U mutation alone was 
transplanted along with the BCoV inter-stem-loop domain into 
the MHV background, viable virus appeared, but so did C145 and 
G169 within the window. This result suggested that C30U may not 
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FIG 2 Phenotypes of representative plaque-purified progeny from cells electroporated with chimeric B'4*"'7?/M RNA. (A) Plaque appearances after 
culturing for 60 h on L2 cells. Isolated small plaques (S1 to S3 out of 6 total) and large plaques (L1 and L2 out of 11 total) were serially passaged 10 times 
on DBT cells. (B) Growth kinetics for three isolated small plaques (hatched lines) and two isolated large plaques (solid lines) following their selection at 


viral passages 1, 6, and 10. 


alone function as a suppressor but may by some unknown mech- 
anism hasten the appearance of suppressor mutations. This phe- 
nomenon was not explored further. 

The BCoV 5’ UTRand the (nearly) entire BCoV Nsp1 coding 
region function together in the MHV background to yield wt 
MHV-like progeny in cell culture. The evidence showing that a 
virus species-specific interaction between the inter-stem-loop do- 
main in the 5’ UTR and its partnering domain in the Nsp1 coding 
region is required for virus replication led us to directly test a 
bidirectional matching of the two domains. Since it had been 
shown previously that the functional BCoV SLIV alone in the 
MHV genome extends 16 nt into the Nsp1 coding region (22), the 
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full BCoV 5’ UTR in addition to the 16-nt extension (i.e., nt 1 to 
226) was used for testing with the remaining MHV Nsp1 coding 
region in the MHV background. For this, BY ?2o/M (Fig. 6A) was 
tested but was found to be nonviable over three blind cell passages 
from three separate electroporation attempts (Fig. 6B) (22), prob- 
ably because of the incompatibility between the inter-stem-loop 
domain (from BCoV) and the Nsp1 partnering domain (from 
MHV) described above. Likewise, no viable progeny were gener- 
ated when a chimera was tested in which the entire MHV 5' UTR 
plus its 16-nt extension into its Nsp1 coding region was used with 
the adapted BCoV Nsp1 coding region (i.e., one lacking the first 16 
nt and last 42 nt) in the MHV background to form B7””-°*/M (Fig. 


jviasm.org 4635 


AYVHEIT THIH HALSIT aN Aq 2102 ‘92 Jequiedeq Uo /Bo-wseIAl//:dyy Wol) papeojuUMog 


Guan et al. 


TABLE 1 Potential compensatory mutations within the 5’ UTR and 
Nsp1 coding region of progeny from the B‘**"'77/M chimera 
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* Plaque purified clones were selected at virus passage 0 (VPO) and identified as small 
(S) or large (L). 

’ Number of base pairs within the base-pairing window (identified as positions 145 to 
169 in the 5’ UTR and 347 to 363 in the Nsp1 cistron). 

© See Fig. 3 for descriptions of the folding patterns. 
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6A and B). Note that for this construct, the MHV 3’ 42 nt of the 
Nsp1 coding region were retained to keep the region compatible 
with the MHV papain-like protease that separates Nsp1 and Nsp2 
(17). However, when a construct in which the BCoV 5’ UTR con- 
taining the 30-nt MHV inter-stem-loop domain was tested in the 
MHV background as in B‘"'*"’*?°/M (Fig. 6A), CPE appeared 
within 48 hpt, and progeny from electroporated RNA (VP0) 
yielded medium plaques, a peak titer of 1.0 X 10° PFU/ml, anda 
full set of viral ssmRNAs (Fig. 6B to D). These results together 
demonstrated a requirement for compatibility between the 5’- 
UTR-mapping inter-stem-loop domain and its cognate partner- 
ing domain within the Nsp1 coding region. Conversely, when a 
construct with the entire MHV 5’ UTR but with the BCoV 32-nt 
inter-stem-loop domain and the BCoV Nsp1 coding region in the 
MHYV background, Borate 8 (Fig. 6A), was tested, CPE 
was observed within 48 hpt, and viable progeny (VP0) producing 
large plaques, a titer of 1.0 X 10° PFU/ml, and a full set of viral 
sgmRNAs (Fig. 6B to D) were obtained. This construct, therefore, 
also demonstrated a compatibility requirement between the 5’ 
UTR-mapping inter-stem-loop domain and its cognate partner- 
ing domain within the Nsp1 coding region. 

These results are fully consistent with the notion that func- 
tional base pairing between the two domains is needed for virus 
replication. Still, the compatibility in these constructs was not 
perfect, since the titer of progeny was 100-fold lower than that for 
wt MHV. Interestingly, when the entire BCoV 5’-proximal region 
of the genome through the Nsp1 coding region to nt 906 was 
tested in construct B!°°/M, CPE was found within 24 hpt and 
viral progeny produced plaques, growth kinetics, a final titer, and 
a sgmRNA profile that were indistinguishable from those of wt 
MHV (Fig. 6A to D). These results further demonstrate a require- 
ment for compatibility between the 5’ UTR and cognate Nsp1l 
coding region for BCoV and MHV when used in the MHV back- 
ground for replication and fitness studies in cell culture. Without 
a reverse genetics system for BCoV, we could not apply the recip- 
rocal test for this finding. 

The 30-nt inter-stem-loop domain in the 5’ UTR is not an 
absolute requirement for MHV replication in cell culture. Since 
in MHV, distinct regions in the 5’ UTR, including SLIII and the 
loop region of SLIV (Fig. 1), and a large region of the 3’ UTR have 
been shown to be dispensable for virus replication in cell culture 
(20, 22, 50), this test was applied to the 30-nt inter-stem-loop 
domain in MHV. Surprisingly, a deletion of nt 141 to 170 in the 
MHV genome (Fig. 7A) was not lethal, but rather, CPE appeared 
after one blind cell passage following transfection of the mutated 
genomic transcript, and progeny showing wt-like large plaques 
and a final growth titer of 1.0 X 10° PFU/ml were obtained (Fig. 
7B and C). Interestingly, an A649C mutation (H147P) was found 
in the Nsp1 coding region of all progeny, and G516A (V103I), 
C583U (T1251), and A880C (E224A) mutations were found in 
some but not all progeny at VP10 (Fig. 7C). Furthermore, a 
UCUAA sequence was found inserted within the upstream tran- 
scription-regulatory sequence of all progeny by VP10 (Fig. 7A and 
C). These results together with the observation of very poor viral 
growth when the BCoV inter-stem-loop domain replaced the ho- 
mologous native MHV domain demonstrate that the inter-stem- 
loop domain is required for optimal viral fitness but is not abso- 
lutely required for viral replication in cell culture. 
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FIG 3 Potential suppressor mutation-induced base pairings in progeny of the B'4?-!”*/M chimera. (A) Summary of potential suppressor mutations and their 
positions within the inter-stem-loop and downstream partnering domains that appeared in the small and large plaque isolates after 10 serial passages (depicted 
as in Fig. 1C). (Top) Base positions (in parentheses) for the BCoV sequence (lowercase) use BCoV numbering. Base positions elsewhere use MHV numbering. 
(Bottom) Summary of base changes at the named positions. The subscript numbers indicate the number of occurrences in 17 total isolates, shown in Table 1. (B) 
Three potential folding patterns as suggested by Mfold that together maximize the base pairings enabled by the suppressor mutations. Any of the three potential 
foldings can accommodate revertants $2, L2, L3, L5, L6, L7, L8, and L11, and others are accommodated as indicated. For revertants depicted in Table 1, base 
pairings within the base-pairing window at passage 10 increased from 19 to 22, 23, 24, or 25 (as summarized in the last two columns of Table 1). 


DISCUSSION 


Here, we report the discovery that for the bovine coronavirus and 
mouse hepatitis coronavirus, separate species in the (formerly 
group 2a) betacoronavirus genus, there exists a functional inter- 
dependence between their respective 5' UTRs and Nsp1 coding 
regions for optimal virus growth in cell culture and that part of 
that interdependence is a long-range (~200-nt) RNA-RNA inter- 
action. Furthermore, it was observed that whereas only some parts 
of the BCoV 5’ UTR and Nsp1 coding region can be transplanted 
into the MHV genome to directly yield viable virus and some parts 
cannot (22), when the entire BCoV 5’ UTR and Nsp1 coding 
region are transplanted together as a cognate unit, nearly fully fit 
MHV wt-like progeny are directly obtained, and there appear to 
be no interspecies constraints on replication. These results there- 
fore show that the BCoV cis-acting RNA elements in the 5’ UTR 
and Nsp1 coding region, although evolutionarily divergent 
(~36% nucleotide divergence in the 5' UTR, ~40% nucleotide 
divergence in the Nsp1 coding region, and ~53% amino acid 
divergence in the Nsp1 protein), are successfully recognized as a 
cognate unit and utilized by the MHV proteins for genome repli- 
cation and packaging with apparently no detrimental effect. The 
long-range RNA-RNA interaction function for replication also 
explains, at least in part, the previously reported cis-replication 
requirement for the 5'-terminal part of the Nsp1 coding region in 
BCoV defective interfering (DI) RNA (7, 8) and the presence of 
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this region in all known naturally occurring group 1 and group 2 
coronavirus DI RNAs (3, 4). They possibly also explain, in part, 
the requirement for this region in the MHV genome for virus 
replication (5). 

Is the identified long-range RNA-RNA interaction a com- 
mon feature of coronaviruses? In a recent comprehensive pre- 
diction of coronavirus 5’-end-proximal higher-order struc- 
tures based predominantly on Mfold and phylogenetic and 
covariation analyses by Chen and Olsthoorn (10), seven sub- 
stantial stem-loops (1 through 5, and unnamed 6 and 7) were 
described for the 5’-terminal 340 nt of group 2a coronaviruses 
that, for BCoV, are essentially identical to the seven stem-loops 
depicted in Fig. 1B (SL1, -2, -IL, -IH, and -IV) and in references 
23 (SLV) and 7 (SLVI) (sequence differences are due to the 
different strains of BCoV used, BCoV-Mebus versus BCoV- 
Ent). It is of interest that for both BCoV and MHV, stem-loops 
V and VI (equivalent to stem-loops 6 and 7 in reference 10; also 
data not shown) map entirely within the Nsp1 coding region 
and (except for possibly a short region as noted below) lie 
between the upstream and downstream partnering domains in 
the long-range RNA-RNA interaction. A possibly analogous 
long-range RNA-RNA interaction is predicted for each of the 
group 2b, 2c, and 2d coronaviruses as well, but in these cases, in 
contrast to the group 2a coronaviruses, the long-distance up- 
stream and downstream base-pairing partners are contained 
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FIG 4 Base pairings between the inter-stem-loop and partnering domains for 10 betacoronaviruses. (A) Alignment of the inter-stem-loop domains for 
MHV-like, HCoV-HKU1, and BCoV-like betacoronaviruses. Shading identifies bases that are in common with MHV-A59. The inter-stem-loop domains are 
identified with a hatched box. (B) Base pairings as predicted by Mfold. Shaded areas identify regions of base pairing common to the whole group. Underlined 
bases within the shaded areas are variant bases relative to those in MHV-A59 that maintain base pairing at the same position. Note that the number of base 
pairings within the inter-stem-loop domain for any given virus ranges from 25 to 27. (C) Potential 5’-end-proximal long-range RNA-RNA interaction derived 
by Mfold analysis of the 5’ region of the SCoV-Tor2 genome. This folding prediction is identical to that named for stem-loop 5 by Chen and Olsthoorn (10). The 


AUG start codon for the Nsp1 coding region is identified with a hatched box. 


entirely within the stem of SL5 (equivalent to SLIV in Fig. 1; see 
also Fig. 4C, and see Fig. 3B in reference 10). SL5 for groups 2b, 
2c, and 2d coronaviruses also harbors a large terminal loop 
that, in turn, contains three smaller stem-loops with primarily 
hexameric (UUYCGU) terminal loops (10). That is, for the group 
2b SCoV-like coronaviruses (harboring an 80-nt terminal loop), 
the group 2c BtCoV-HKU5-1-like coronaviruses (harboring a 
94-nt terminal loop), and the group 2d BtCoV-HKU9-1-like 
coronaviruses (harboring a 55-nt terminal loop), there is a long- 
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range RNA-RNA interaction, but the upstream portion is nearly 
entirely within the 5’ UTR and the downstream portion is con- 
fined to the 5’-terminal 29 to 113 nt of the Nsp1 coding region. In 
a pattern similar to that of the group 2b through 2d coronaviruses, 
long-range RNA-RNA interactions that might also be analogous 
are predicted within SL5 of the group la TGEV-Purdue-like coro- 
naviruses (with a 106-nt terminal loop), the group 1b HCoV- 
229E-inf-1-like coronaviruses (with a 100-nt terminal loop), the 
group lc PEDV-CV777-like coronaviruses (with a 116-nt termi- 
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FIG 5 Testing of B'4*-!7351/M chimeric RNA for replication. (A) B'4?-!73/M RNA was made to contain C158G, A163U, and G169U (*) in order to mimic 
reverted virus S1 at virus passage 10. The predicted base-pairing pattern of B'4?-"”*$"/M long-range RNA-RNA interaction is shown. BCoV sequence is indicated 
by lowercase letters. (B) Plaque patterns of wt MHV and B'*?-!”>51/M at VP1. (C) Growth kinetics of wt MHV-A59 and B'4?"!7351/M at VP1. (D) Northern blot 
showing genomic and sgmRNA patterns for uninfected cells (lane 1), wt-MHV-infected cells (lane 2), and B?’7*‘1/M virus-infected cells (lane 3). The 
radiolabeled probe was an oligonucleotide complementary to a 26-nt region within the MHV 3’ UTR. 


nal loop), and the group 1d GId BtCoV-1A-like coronaviruses 
(with a 114-nt terminal loop). That is, in the group 1 coronavi- 
ruses, the upstream portion of the long-range interaction and the 
terminal loop map entirely within the 5’ UTR and the down- 
stream portion is confined to the 5’-terminal 9 to 13 nt of the Nsp1 
coding region (10). Intriguingly, three small stem-loops contain- 
ing mostly the VUUYCGU loop, like those in the group 2b through 
2d coronaviruses, make up the large terminal loop of SL5 in the 
group | coronaviruses (10, 42). Within the group 3 coronaviruses, 
no base pairing between the 5’ UTR and open reading frame 1 
coding region was noted (10). For coronaviruses other than MHV 
and BCoV, the functional significance of the specific long-range 
RNA-RNA interaction, to our knowledge, has not been experi- 
mentally examined; however, the large terminal loop in SL5 of the 
group 2b through 2d and group 1 coronaviruses is a postulated 
genome packaging signal (10, 18). 

One interesting feature of the BCoV 72-nt SLV and 30-nt 
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SLVI that map within the loop of the long-range RNA-RNA 
interaction is that each functions as a higher-order cis-replica- 
tion element for DI RNA in virus-infected cells based on anal- 
ysis of synonymous structure-disrupting and compensatory 
structure-restoring mutations (7, 23). In experiments concur- 
rent with those described above, attempts were made to analyze 
the cis-acting role of MHV SLV and SLVI in viral replication. 
For this, however, structure-disrupting synonymous muta- 
tions were found available in only the MHV 83-nt SLV (nt 233 
through 315) in which mutations C242U and C239U as a pair 
in the left strand and G305A and G308A as a pair in the right 
strand were separately made and tested. These mutations al- 
tered SLV AG from —32.5 kcal/mol to —26.7 and —26.9 kcal/ 
mol, respectively, but resulted in no changes in progeny plaque 
size or replication kinetics (data not shown). Hence, the poten- 
tial role of MHV SLV and SLVI in virus replication has not yet 
been determined. In a retrospective analysis relevant to the 
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FIG 7 Deletion of the 30-nt inter-stem-loop domain in MHV leads to viable 
but debilitated progeny. (A) Schematic depiction of the MHV 5’ UTR with a 
deleted inter-stem-loop region (A141-170) and the inserted UCUAA element. 
The UCUAA insertion could theoretically have occurred following nt 59, 64, 
or 69 (illustrated as the latter). (B) Plaque patterns for plaque clones | to 3 at 
virus passages 1 and 10. (C) Genotypes and virus titers for the viruses depicted 
in panel B. 


long-range RNA-RNA interaction, however, we noted that the 
downstream 9 nt of SLVI in BCoV (nt 332 to 340) (7) are also 
part of the downstream base-pairing window in the long-range 
RNA-RNA interaction (Fig. 1B). That is, the mutations that 
destabilized the lower stem of SLVI in BCoV DI RNA (C333G 
and C336U) (7) would also have decreased base pairing in the 
long-range RNA-RNA interaction from 25 to 24 (Fig. 1B). 


Coronavirus 5’ Long-Range RNA-RNA Interaction 


Thus, either disruption (i.e., that in SLVI or that in the long- 
range RNA-RNA interaction) may have been the lesion that 
blocked DI RNA replication. This arrangement may suggest a 
possible functional switching between two alternate structures. 
A similar arrangement involving the four 3’-terminal nucleo- 
tides in MHV SLVI (nt 332 to 335) is also possible (data not 
shown). 

Inasmuch as the 5’ UTR-mapping 30-nt inter-stem-loop 
component of the long-range RNA-RNA interaction can be 
deleted without killing MHV replication (Fig. 7), the question 
of what role(s) the long-range RNA-RNA interaction might 
play that could be considered optional for virus survival arises. 
Currently, we can only speculate from known functions of 5’- 
end-proximal cis-acting replication elements in coronaviruses 
and other plus-strand RNA viruses (see the introduction). 
Based on available data, it is plausible that the long-range RNA- 
RNA interaction plays a role in one or both of two potential 
regulatory pathways. First, it might regulate an optional non- 
canonical translation initiation site in a manner that ensures 
viral genome translation under conditions of cell stress when 
canonical translation initiation mechanisms are largely re- 
pressed (1, 26, 30, 44). This role is suggested by the demonstra- 
tion of a potential internal ribosomal entry site on group 3 
coronavirus genomic RNA that needs yet to be demonstrated in 
other coronavirus groups (34) and by robust translation of 
MHV RNAs during global inhibition of cellular mRNA trans- 
lation (2, 40, 45). Such an alternate mechanism of translation 
initiation has been shown to function on some capped and 
polyadenylated viral and cellular mRNAs, to be influenced by 
double-stranded RNA regions near the translation start codon, 
and to be under the control of trans-acting proteins (1, 26, 
30, 44). 

A second potential regulatory pathway for the long-range 
RNA-RNA interaction could be maintenance ofa proper acceptor 
orientation for the RNA-dependent RNA polymerase (RdRp) 
template switching during generation of minus-strand templates 
for subgenomic mRNA synthesis (43, 49, 55). This is suggested by 
the appearance of the UCUAA component within the UCUAAAC 
intergenic transcriptional regulatory sequence following deletion 
of the 30-nt inter-stem-loop region (Fig. 7). Addition of the 
UCUAA element in the same region was also found when stem- 
loop 4 (referred to as SLIT in Fig. 1A) was deleted, which led to the 
suggestion that stem-loop 4 functions as a required spacer element 
for sgmRNA synthesis (50). In earlier studies, the UCUAA ele- 
ment was also found to spontaneously appear at this site in some 
MHV strains in which its occurrence positively correlated with 
enhanced transcription (37, 51, 53). Regulation of mRNA synthe- 
sis is also suggested by the increased synthesis of ssmRNAs when 
amino acid-changing mutations were made in the (then un- 
known) downstream partnering domain of the MHV long-range 
interaction during the analysis of Nsp1 function by Brockway and 
Denison (5). For Nsp1 functional analysis in MHV-A59, 13 amino 


FIG 6 Requirement for compatibility between the 5’-UTR-located inter-stem-loop and downstream partnering domains for MHV replication. (A) For viability, 
the chimeric virus must contain the MHV inter-stem-loop domain within the BCoV 5’ UTR when in the MHV background (as in B''4""!7*7°/M), the BCoV 
inter-stem-loop domain and BCoV Nsp1 coding region when in the MHV background (as in B'4?-?”*,?27-°°°/M), or the BCoV 5’ UTR and BCoV Nsp1 coding 
region when in the MHV background (as in B'°°*/M). Note that the in-frame C-terminal fusion site for the Nsp1 coding region occurs between BCoV nt 906 
and MHV nt 909. (B) Plaque phenotypes for the progeny from the variant constructs described for panel A. (C) Growth kinetics of the progeny depicted in B. (D) 


Northern analyses of the progeny depicted in for panel B. 
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acids within the amino-terminal third of Nsp1 were changed to 
alanine by reverse genetics and tested for viability. Although only 
some of the mutations were found to be lethal, two tested simul- 
taneously, E46A and K48A, made by A346C, A351G, and A352C 
mutations, were not lethal but inexplicably caused an ~20% in- 
crease in overall viral RNA synthesis (5). Although these three 
mutations which map within the long-range base-pairing window 
would have decreased the base pairings only within the window 
from 27 to 25, there might have been enough RNA rearrangement 
to affect upstream RdRp template switching and an increase in 
subgenomic mRNA synthesis. Alternatively, the amino acid 
changes could have caused alternate template switching behavior 
by an entirely different unknown mechanism. 

From the present study, there is no clear evidence to suggest 
that MHV behavior was altered as a function of amino acid 
changes arising from suppressor mutations. However, in no case 
was an amino acid-changing mutation found in the absence of 
other potential suppressor mutations, making it unfeasible to as- 
sess the effects of single amino acid changes alone. Of the five 
suppressor mutations within the downstream Nsp1 coding re- 
gion, four (S3, $4, L2, and L5) led to amino acid changes within 
Nsp1 (K52E, P47T, K48N, and K50T, respectively) (Table 1), but 
progeny from each of these made large plaques (Fig. 2A and data 
not shown), and two, S3 and L2, had near-wt growth kinetics at 
passage 10 (Fig. 2B). No evaluation of viral RNA levels was made 
for any of these, however. 
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